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Information encoding bv shortened Reed-Solomon codes 



The present invention concerns communication systems in which the 
data to be transmitted are subjected to a channel encoding in order to improve 
5 the fidelity of the transmission. It concerns more particularly decoding methods 
as well as the devices and apparatuses adapted to implement those methods. 

It will be recalled that so-called "channel" encoding consists, when 
the "codewords" sent to the receiver are formed, of introducing a certain amount 
of redundancy in the data to be transmitted. More particularly, by means of each 
10 codeword, the information is transmitted that is initially contained in a 
predetermined number k of symbols taken from an "alphabet" of finite size q ; 
on the basis of these k information symbols, calculation is made of a number 
n > k of symbols belonging to that alphabet, which constitute the components 
of the codewords: vs(v 0 ,v lf ....v,^) (the symbol "s" means "by definition"). 

15 The set of codewords obtained when each information symbol takes some 

value in the alphabet constitutes a sort of dictionary referred to as a "code" of 

"dimension" k and "length" n . 

In particular, certain codes, termed "linear codes" are such that any 

linear combination of codewords (with the coefficients taken from the alphabet) 
20 is still a codeword. These codes may conveniently be associated with a matrix 

H of dimension (n-k)xn, termed "parity matrix": a word v of given length n is 

a codeword if, and only if, it satisfies the relationship: H ■ v T = 0 (where the 
exponent T indicates the transposition); the code is then said to be "orthogonal" 
to the matrix H . 

25 Among these linear codes, certain have the property of being 

"cyclic": for any given codeword (v 0 ,v lf ...,v„_]), the word (v„_i,v 0 ,vj, ...,v w _ 2 ) 
obtained by cyclic permutation is a word of the same code. The words 
belonging to a cyclic linear code may conveniently be represented by means of 

polynomials: if the polynomial v(x) s ^v,*' is made to correspond to some 
30 word v = (vq.V], ...,v w _i), a given word v belongs to a given cyclic linear code if, 
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and only if, the corresponding polynomial v(x) is a multiple of a certain 

polynomial g(x), of degree (n-k) and divisor of (x n -I), termed "generator 
polynomial" of the cyclic linear code which characterizes it. Moreover a parity 
matrix H can be associated with any generator polynomial g(x). 
5 At the receiver, the decoding method correlative to a given encoding 

method judiciously exploits the redundancy incorporated in the codewords to 
detect any transmission errors and if possible to correct them. There is a 
transmission error if the difference e between a received word r and the 
corresponding codeword v sent by the transmitter is non-zero. 
1 0 More particularly, the decoding is carried out in two main steps. 

The first step consists of associating an "associated codeword" with 
the received word. Conventionally, the decoder first of all calculates the vector 

of "error syndromes" H • r T = H • e T . If the syndromes are all zero, it is assumed 
that no transmission error has occurred, and the "associated code word" will 

15 then simply be taken to be equal to the received word. If that is not the case, it 
is thereby deduced that certain symbols in the received word are erroneous, 
and a correction algorithm is then implemented which is adapted to estimate the 
value of the error e ; the algorithm will thus provide an estimated value | such 
that (r-S) is a codeword, which will then constitute the "associated codeword". 

20 The second step simply consists in reversing the encoding method. 

In the ideal situation in which all the transmission errors have been corrected, 
the initial information symbols are thereby recovered. 

The purpose of an error correction algorithm is to associate with the 
received word the codeword situated at the shortest Hamming distance from 

25 that received word, the "Hamming distance" being, by definition, the number of 
places where two words of the same length have a different symbol. The 
shortest Hamming distance between two different codewords of a code is 
termed the "minimum distance" d of that code. This is an important parameter of 
the code. More particularly, it is in principle possible to find the position of the 

30 possible errors in a received word, and to provide the correct replacement 
symbol (i.e. that is identical to that sent by the transmitter) for each of those 
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positions, each time the number of erroneous positions is at most equal to 
INT((cM)/2] (where "INT" designates the integer part) for a code of minimum 
distance d (for certain error configurations, it is sometimes even possible to 
achieve better). However, in all cases, the concern is not with a possibility in 
5 principle, since it is often difficult to develop a decoding algorithm achieving 
such performance. 

The present invention concerns in particular "Reed-Solomon" codes, 
which are reputed for their efficiency. These are cyclic linear codes, of which the 
minimum distance d is equal to ( n - k + 1 ). In general terms, codes of dimension 

10 k and of length n having a minimum distance d = n-k + \ are termed "MDS" 
(for "Maximum Distance Separable") since they are codes of which the 
minimum Distance allows a Maximum Separation between the codewords, 
account taken of the parameters k and n ). 

When the size q of the "alphabet" is a power of a prime number, the 

1 5 alphabet can be given the structure of what is known as a "Galois field" denoted 
F q , of which the non-zero elements may conveniently be identified as each 

being equal to y' -1 for a corresponding value of /, where / = 1 q-\, and 

where y is a primitive (q -l) th root of unity in F<,. 

The parity matrix H of the Reed-Solomon code of dimension k and 
20 of length n (where n is necessarily equal to (q-\) or a divisor of (#-1)) is a 
matrix with m = n-k lines and n columns, which may be defined for example 
by taking Hy= a (i+l)J (0 s / £ m-\ , 0 £ j £ n-1), where a is an n ,h root of 

m 

unity in F q . The generator polynomial of this code is g(x) = Y[( x ~ a ' ) • 

;=1 

Among the algorithms known for encoding a sequence 
25 a = {a 0 ,a { , ...,a k _ x ) of information symbols belonging to F Q by means of a Reed- 
Solomon code, certain use that parity matrix H. In these algorithms, a certain 
relationship is chosen between the information symbols and those of the 
corresponding codeword v (for example v,- = or, for 0 <, i < k - 1 ; in this case, 

the encoding is said to be "systematic"). Next the components of v remaining to 
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be determined are calculated using the matrix equation H v T =0 it is thus 
necessary to solve (n-k) linear equations on F q , which constitutes a relatively 
complex calculation. Other encoding methods use the divisibility of the 
polynomial v(x) by the polynomial generator g(x)\ this is in particular the case 
5 of the so-called "Euclidean" algorithm, of which the operation will be described 
in detail below, and which requires calculations which are appreciably less 
complex than the algorithms just mentioned using the parity matrix H. 

To decode Reed-Solomon codes, a so-called "Berlekamp-Massey" 
algorithm is usually employed for the detection of the erroneous positions in a 

10 received word, and a so-called "Forney" algorithm for the correction of the 
corresponding erroneous symbols. For more details on Reed-Solomon codes, 
reference may for example be made to the work by R.E. Blahut entitled "Theory 
and practice of error-control codes", Addison-Wesley, Reading, Mass., 1983). 

Like all codes, Reed-Solomon codes may be "shortened". It is said 

15 that a code C of length n , is a "shortened" version of a code C* of length ri , if it 
comprises solely the words v; of C of which, for a number R of predetermined 
positions, the components are all zero: as these positions are known to the 
receiver, their transmission can be obviated, such that the length of the 
shortened code is n = n'-R . When C is an "MDS" code as defined above, this 

20 amounts to encoding information sequences of reduced length 
k = k'-R = k'-n'+n (where k' designates the dimension of the code C) by 
means of the shortened code C, while however keeping the same redundancy 
m = n'-k'= n-k . 

It will be noted that shortened Reed-Solomon codes have recently 
25 increased in importance due to the fact that encoding by means of a new, very 
efficient, type of code, termed "algebraic geometric code" may, in certain cases, 
be conveniently reduced to encoding by means of a plurality of shortened Reed- 
Solomon codes (for an introduction to algebraic geometric codes, reference 
may for example be made to the article by J.H. van Lint entitled "Algebraic 
30 Geometric Codes" in "Coding Theory and Design Theory" 1st part, IMA 
Volumes Math. Appl., volume 21, Springer-Verlag, Berlin, 1990). An encoding 
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method of that type is disclosed in the application FR-0301546 in the name of 
CANON. 

In will be noted in passing that when it is intended to delete certain 
components of the codewords, it matters little if the fixed values of those 
5 components be all zero; they may possibly each have some predetermined 
value. 

Clearly, a shortened Reed-Solomon code is still a linear code, but it 
is no longer a cyclic code. In order to form, on the basis of a sequence of 
information, a codeword belonging to a shortened Reed-Solomon code, it is 
10 naturally possible to implement an algorithm utilizing the parity matrix H' of the 
code H\ it suffices to assign the value zero (or a predetermined respective 
constant) to said predetermined components of v| in the equations represented 

by H' v' T = Q. However, as has been stated, that algorithm utilizing the parity 
matrix necessitates complex calculations. 

15 It would thus be desirable to be able to perform that encoding using a 

less complex algorithm, for example the Euclidean algorithm. Unfortunately, that 
algorithm does not enable a predetermined value (even zero) to be imposed in 
a natural manner on certain predetermined components of the codeword to be 
calculated. Without entering into details, it will simply be noted here that certain 

20 manipulations of polynomials are known making it possible to adapt the 
Euclidean algorithm to meet that requirement, but solely in the case in which 
said predetermined positions of the word v| are consecutive. This is however 
insufficient for example in the case of applications to the geometric algebraic 
codes concerned in the application FR-0301546 mentioned above, in which the 

25 components deleted by virtue of the shortening of a Reed-Solomon code are 
not necessarily consecutive. 

According to another point of view, it is known to "puncture" the 
codewords at transmission: puncturing consists of dispensing with the 
transmission of certain predetermined components, whatever the value (not 

30 predetermined in this case) of those components. The drawback of puncturing 
with respect to shortening is that the components deleted by puncturing include 
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a portion of the redundancy provided by the channel encoding. An irrecoverable 
loss of that redundancy thus ensues. 

Thus, according to a first aspect, the invention relates to a method of 
encoding over a Galois field F,, where g is an integer greater than 2 and equal 

5 to a power of a prime number, in which a word y'=(v'o. v, i • ••••v'„'-i) is 
calculated, where belonging to an "MDS" linear cyclic code C of 

dimension (n'-m), where \<,m<>ri-2, on the basis of a word 

a = {a Q ,a x a n _ m _\) of information symbols, where m <n<ri. A set 

s = (s 0 ,s\ s„_i) of strictly increasing integers, with s 0 ZO and s„_\ S/t'-l, 

10 having been predetermined, said method comprises the following steps: 

n-\ 

a) forming the polynomial a(x)& ^ai_ m x s ' , 

i=m 

b) calculating the remainder r{x) of the Euclidean division of a(x) by 

m 

the polynomial g(x) = ^g p x p generating said code C, 

p=Q 

n'-\ 

c) calculating the polynomial v * (x) = a(x) - r(x) ■ ^ v */ *' - 

/=o 

15 corresponding to the word v* = (v* 0 ,v* lf ...,v*„'_|), and 

d) taking v'= v* if s m _^ =m-\; othenwise obtaining said word v' by 

taking: 

s m -m-l 

in which the words r y of length «* are defined by: r J i = for y ^ / < y + m , 

20 and r 7 / = 0 othenwise, and in which the elements fj of F q are calculated by 
means of the equations (1) in which, for the (s m -m) values of i<s m not 
belonging to the set s, each component v', is taken equal to a respective 
predetermined constant. 
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Thus, according to the invention, encoding is performed of any 
information word a of length k in the form of a word v' belonging to a Reed- 
Solomon code C of dimension k % and length ri (with * - k ) such that the 
components of v f situated in (ri-n) arbitrary predetermined positions be 
5 systematically equal to respective predetermined constants (for example, all 
zero). 

It will be noted that the method according to the invention is 
advantageously simple to implement. This is because on the one hand the 
method utilizes the Euclidean algorithm which is, as has been stated, of low 
10 complexity in comparison to an encoding on C explicitly relying on a parity 
matrix H\ On the other hand, the method according to the invention comprises 
solving a system of linear equations having the coefficients fj as unknowns; 

however, as will be seen below on the basis of example embodiments, the 
dimension of that system of equations is generally only a few units. 

15 Further to the steps succinctly stated above, it will naturally be 

possible to delete those components of fixed value to obtain a word v of length 
n belonging to a code C, which thus constitutes a code that is shortened with 
respect to code C\ By virtue of the invention, as the components of 
predetermined positions take, after encoding in C\ a value fixed in advance, 

20 their deletion to form words of the shortened code C will not lead to any loss in 
redundancy. 

For certain applications, it may be worthwhile to "modify" code C. A 
code C 8 is a "modified" version of a code C of length n if there is a square non- 
singular diagonal nxn matrix B such that each word t/ 8 of C s is equal to vJS 
25 with v in C. Therefore in the case in which it is desired to encode information 
symbols by means of words belonging to code C 8 rather than to code C, the 
following procedure will be followed: 

Let p, denote the element of B in position (/,/). Said procedure then 

comprises: 

30 - constructing, on the basis of the information symbols, the polynomial 
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a fl (*) H "f>_J*jy' . 

- implementing steps b), c) and d) of the method according to the 
invention, by replacing a(x) with a B (x), which gives a word v %B , 

- deleting the components of predetermined value from v tB , which gives 
5 a word v B = (v*o,v*i v*„-i), and 

- calculating the word v = (v 0 ,v lf ...,v w .,) defined by: v, = v B i $ i for all / 
from 0 to(w-l). 

According to particular features of the invention, ri is equal to (q-l) 

m 

or is a divisor of and g(x) = f|(x-a'), where a is an element of F q 

10 satisfying a" = 1. In this case, the code C is thus a Reed-Solomon code, and C 
is a shortened Reed-Solomon code. This particular case of "MDS" linear cyclic 
code may in particular be useful for encoding by means of a complex code of 
which at least the encoding or the decoding may be reduced, in a certain 
manner, to the encoding or decoding of at least one shortened Reed-Solomon 

15 code. The present invention may thus, for example, be applied to the method 
disclosed in application FR-0301546, mentioned above, in which said complex 
code is an algebraic geometric code. 

More generally, still according to the same first aspect, the invention 
relates to a method of encoding for algebraic geometric codes, comprising at 

20 least one step in which codewords belonging to a shortened "MDS" linear cyclic 
code are calculated, said method being remarkable in that said calculation is 
performed by means of any one of the encoding methods succinctly described 
above. 

According to a second aspect, the invention relates to a device for 
25 encoding over a Galois field F g , where q is an integer greater than 2 and equal 

to a power of a prime number, in which a word v , = (v , 0 ,v , ll ...,v , w »_ 1 ) is 

calculated, where w*>3, belonging to an "MDS" linear cyclic code C of 
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dimension (ri-m), where l£m£ri-2, on the basis of a word 

a = (fl 0 ,a, of information symbols, where m<n<ri. A set 

s = {sq,s\ j„_j) of strictly increasing integers, with s 0 z0 and s„_ { Sn'-l, 

having been predetermined, said device is adapted to : 

5 a) form the polynomial a(x)s , 

b) calculate the remainder r(x) of the Euclidean division of a(x) by the 

m 

polynomial g(x) = ^g p x p generating said code C, 

n'-\ 

c) calculate the polynomial v * (x) = a(x) - r(x) ■ £ v *, *' , 

i=0 

corresponding to the word v* = (v*o, v *i. ...,v*„--i). and 
10 d) take v'= v* if s m _ { =m-\; otherwise to obtain said word v' by 

taking: 

v'=v*+ £ fjTj , (1) 

7=0 

in which the words r J of length ri are defined by: T J i = g,_y for j<iz j + m, 

and r y i = 0 otherwise, and in which the elements fj of F q are calculated by 
15 means of the equations (1) in which, for the (s m -m) values of i<s m not 
belonging to the set s, each component v', is taken equal to a respective 
predetermined constant. 

According to particular features, ri is equal to (q - 1) or is a divisor of 

m 

(?-l),and g(x) = !"[(*- a'), where a is an element of F q satisfying a" =1. 
/=i 

20 According to a third aspect, the invention also relates to various 

apparatuses. 
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Thus, firstly, the invention concerns an apparatus for processing data 
comprising a source of information symbols, said apparatus being remarkable in 
that it further comprises: 

- a storage unit adapted to accumulate said symbols so as to form 
5 codewords a each containing a predetermined number k of symbols, 

- an encoding device as described succinctly above, and 

- a transmitter adapted to transmit the words v* resulting from the 
encoding of said information symbols. 

Secondly, the invention concerns an apparatus for processing data 
comprising a source of information symbols, said apparatus being remarkable in 
that it further comprises: 

- a storage unit adapted to accumulate said symbols so as to form 
codewords a each containing a predetermined number k of symbols, 

- an encoding device as described succinctly above, 

- a shortening unit adapted to delete said components of predetermined 
value from v' , so as to form a word v of length n , and 

- a transmitter adapted to transmit the words v resulting from the 
encoding of said information symbols. 

The invention also relates to: 

- a non-removable data storage means comprising computer program 
code instructions for the execution of the steps of any one of the encoding 
methods succinctly described above, 

- a partially or wholly removable data storage means comprising 
computer program code instructions for the execution of the steps of any one of 
the encoding methods succinctly described above, and 

- a computer program containing instructions such that, when said 
program controls a programmable data processing device, said instructions 
lead to said data processing device implementing one of the encoding methods 
succinctly described above. 

The advantages of that encoding device, those data processing 
apparatuses, those data storage means and that computer program are 
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essentially the same as those of the corresponding encoding methods 
succinctly described above. 

Other aspects and advantages of the invention will emerge from a 
reading of the following detailed description of particular embodiments, given by 
5 way of non-limiting example. The description refers to the accompanying 
drawings, in which: 

- Figure 1 is a block diagram of a system for encoding information 
symbols according to an embodiment of the invention, and 

- Figure 2 shows a data processing apparatus comprising an encoder 
10 according to the invention; 

Figure 1 is a block diagram of an encoding system according to an 
embodiment of the invention. 

The function of this system is to encode information of any nature 
from a source 100. First of all, the source 100 puts this information into the form 
15 of symbols belonging to a certain alphabet (for example bytes of bits in the case 
in which the size q of the alphabet is 256), and transmits these symbols to a 
storage unit 101, which accumulates the symbols so as to form sets each 
containing k symbols. Next, each of these sets is transmitted by the storage 
unit 101 to an encoding unit 102 which constructs a "pre-encoded" word v' 
20 belonging to a certain Reed-Solomon code (in the strict sense) C*. 

It will now be recalled how the so-called "Euclidean" encoding 
algorithm operates for a cyclic linear code of length n and polynomial generator 
g(x) , of degree m = n-k. 

Firstly, on the basis of the sequence a = (a 0 ,a h ...,a k _ x ) of 

25 information symbols, there is formed the polynomial 

/7-1 



Next a(x) is divided by g(x), which is written: 
a(x) = q(x)g(x) + r(x) , 
where q(x) designates the quotient, and in which the degree of the remainder 
30 r(x) is strictly less than m . If we then take 
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v(x)»a(x)-r(x) , 

it can be seen that v(x) is a multiple of g(x) , and thus truly represents a word v 
belonging to the code. Explicitly, if we write 

m-l 

r(x) a £ r iX ' , 
<=0 

5 the codeword obtained by this Euclidean algorithm is 

v = ( -r o - r m-l> a 0' a \ a k-\) • 

Among the components of v so obtained, k are respectively equal to 
components of a : this is thus a case of "systematic" encoding. 

For example, take: q = 2 4 , k = \0, and « = ?-l = l5, i.e. m = 5. 
10 Consider more particularly the Galois field in which a primitive element y 
satisfies y 4 + y + 1 = 0 . The polynomial generator is 

5 

g(*) = n(*-Y') = * 5+ Y 7 * 4 +Y 2 * 3 +y 5 x 2 +yx + l . ( 2 ) 
i=l 

Which divides (x 15 -l). 

Let for example 
15 a = (y 4 ,0,y 7 ,1,0, 0,0,0,0,0 ) , 

corresponding to the polynomial 
a(x) = x s +y 7 x 7 +y 4 x 5 . 
be an information sequence to be encoded. 

The Euclidean division of a(x) by g(x) then respectively gives a 
20 quotient and a remainder equal to 

q(x) = x 3 +y 2 x + y n , r(x) = yx 4 +y 4 x 2 +y 6 x 2 +y ,4 jc + y 12 . 

Finally (noting that on F 16 , the signs + and - are equivalent), we 

obtain 

v(x) m x s + y 1 x 1 + y 4 x 5 + yx 4 + y 4 x 3 + y 6 x 2 + y 14 x + y 12 , that is 
25 v = (y 12 ,Y 14 ,y 6 .y 4 ,y,y 4 I 0,y 7 ,l,0,0,0,0,0,o) . 
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If, instead, it had been desired to calculate v by solving H • v T = 0 , 
this would have been performed working with the parity matrix 

Y Y 2 Y 3 Y 4 Y 5 Y 6 Y ? Y* Y 9 y'° Y M Y 12 Y* 3 Y H 
y 2 y 4 y 6 y 8 y l0 y l2 y l4 y y 3 y 5 y 7 y 9 y 1l y 13 

//= . y 3 / y 9 y l2 , y 3 y 6 y 9 y l2 , y 3 y 6 y 9 y 12 

Y 4 Y 8 Y 12 Y Y 5 Y 9 Y 13 Y 2 Y 6 y'° y' 4 Y 3 Y ? Y 11 
LI Y 5 Y 10 I Y 5 Y 10 1 Y 5 y'° 1 Y 5 Y 10 I Y 5 y' 0 

which clearly implies burdensome calculations. Thanks to this example, it can thus 
5 be seen that the Euclidean algorithm provides the codeword sought in a much less 
complex manner than by using the parity matrix of the code. 

The present invention makes it possible to take advantage of that 
simplicity by judiciously adapting the Euclidean algorithm to the case of 
shortened codes, whatever the positions of the components of the codewords v' 
10 (of length ri) intended to be conserved in the shortened codewords v (of length 
n ). The set of these positions will be denoted 
s = (sq,S\, ...,s„_i) , 

in which the integers s, ( 0 < / < « - 1 ) are strictly increasing, and naturally s 0 z0, 
j„_, In the context of the invention, an "MDS" cyclic linear code, as 

15 defined above, is designated by C and the code obtained by shortening the 
code C is designated by C. 

More particularly, the method according to the invention performs a 

"systematic" encoding of any information sequence a = (a 0 ,fl, a k _{). It 

comprises the following main steps. 

n-\ 

20 In a first step, forming the polynomial a{x)= ^^/-/n^ 4 ' (where 

i=m 

m s ri-k'= n-k) on the basis of the information sequence a . 

In a second step, calculating the remainder r(x) of the Euclidean 
division of a(x) by g(x), which is the polynomial generator of C. As g(x) is of 
degree m , the degree of r(x) is at most equal to (m -1) . 
25 In a third step, calculating the polynomial 
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n'-l 

v*(x) = a(x)-r(x)a Y,v*! x' , 

/=0 

corresponding to the word v* = (v* 0 ,v*|, ...,v*„._,) belonging to C. It should be 
noted that, by construction, the contribution of a(x) to v*(x) only contains 
powers of x belonging to s (the smallest being greater than or equal to s m ). 

5 The two main cases are then to be envisaged for the fourth step of 

the method according to the invention. 

If the m first integers of s comprise all the successive integers from 
0 to (m-1), then the contribution of r(x) to v*(x) only contains powers of x 
belonging to s (the largest being less than or equal to j ot _, =m-l). In this 
1 0 case, it suffices to take v'= v * . 

On the other hand, if there is at least one "hole" in the succession of 
the m first integers of s (and thus j ot _, >m-l, s m >m, and so forth), 

calculation will now be made of a word v' satisfying: 
• v 'i = v */ f° r s m^i^ (contribution of a(x) ), and 
15 • v'j^Cj, in which the c, are predetermined constants, for all i<s m not 

belonging to s , that is to say different from s 0 , s { s m _ { ; these values of 

/ are thus (s m -m) in number. 

It will be noted that v*,=0 for m<>i<(s m -\)\ but, in the case 
considered in which s m _ x >m-\, v', = 0 will not necessarily be satisfied for 
20 those indices, even when c, = 0 . This is because, according to the invention, a 
word v' belonging to C is obtained by linear combination of v * with e , where 
e = s m -m, judiciously chosen words tJ belonging to C: 

v'=v* + Z//r; , (1) 

7=0 

in which the coefficients fj belong to F<,. Explicitly, this means: 
25 • for s m <L i n'-l : v'^v*,-, (1a) 
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s m -m-\ 

• forms/ <s m : v',= £ //^/ , and (1b) 

• forO<;/<m : v', = v*,+ £ //^/ . (1c) 

y=o 

The words W are constructed from the coefficients of 

m-l 

S(x) ■ X g p x p . More particularly, the words T J are defined as follows- 

p=0 

5 for j <: i < j + m : T J i = , and (3a) 

- T J i f = 0 otherwise. (3^ 
Explicitly: 

E° =(go-gi Sm.O 0) . 

E 1 =(0,g 0 .gh.. ,g m ,0,...,0) , 



10 

r e_1 = (0,...,0,g 0 ..gi Sm.0, •.<)), in which the index of the last non-zero 

component is / = s m - 1 . 

The polynomial corresponding to the word r° being identical to 
g(x), it is of course divisible by g(x), and r° thus belongs to C\ The other 
15 words r J just being circular permutations of r° , they also belong to C 

It then only remains to find the e coefficients of fj which satisfy the 
system of e equations concerned (in which it is stipulated that v', = c, , as 
mentioned above) among the ri linear equations represented by equation (1). It 
can be shown that, with the definitions (3a-3b), the determinant of that system is 
20 non-zero whatever said equations concerned are; this property is linked to the 
"MDS" character of the code C\ 

The method described above will now be illustrated by means of a 
numerical example. 
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Take, for example, for code C, the one which served above by way 
of example, to illustrate the Euclidean algorithm (q = 2 4 , *' = 1 0 , n' = $ - 1 = 1 5 , 
m = 5 , and g(x) given by equation (2)). It is desired to shorten it to construct a 
code C of dimension k = 5 and length n = 10; more specifically, suppose that it 
5 is wished to obtain v* t = 0 for / not belonging to 

£ = (1,2,4,5,7,8,10,1 1,13,14) . 
Thus, s m = 8 , and e = 8 - 5 = 3 , and the three indices in question (that is to say 
not belonging to s, and less than s m =8) are: 0, 3 and 6. This then gives: 

r 0 =(l,Y,y 5 .y 2 ,y 7 ,l,0,...,0) , 

10 E 1 =(0,l,y,y 5 ,y 2 ,y 7 ,l,0 0) , 

T 2 =(0,0,l,y,y 5 ,y 2 ,y 7 ,l,0 0) , 

and thus, in particular: 

r°o=o,r 0 3=y 2 ,r 0 6=o, 

r'o=o, r , 3=y 5 , ^6=1, and 

15 r 2 0 =o, r 2 3 =y, r 2 6=y 7 . 

Let for example the following information sequence be encoded 
2 = (Y 9 .0,y ,, .0,o). 
The following polynomial is then made to correspond to it 
a(x) = Y ll jc M +y 9 x 8 , 
20 and Euclidean division by g(x) gives: 

4(x) = Y n x 6 +yV+Y 9 x 4 +Y 6 x 3 +y ,4 x 2 +y 13 +Y 8 , and 
r(x) = yx 4 +yx 3 +y ]3 x 2 +y ,0 x + y 8 . 
Consequently, 

v'W = y"x ll + yV +yx 4 +yx 3 +y ,3 x 2 +y ,0 x + y 8 , 
25 and thus: v * 0 = y 8 , v * 3 = y (naturally v * 6 = 0 ). 

Solving the 3 equations (1) then gives: 
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/ 0 =Y 8 ./l=Y 2 ./2=Y ,() . 

Finally, the following is obtained: 

v^v'+yV+Y^ 1 +Y ,0 r 2 -(oy 4 i y l2 i O l l,0,0 f y ,o i y 9 i O i O ( Y ,, AO i o). 
In this embodiment, once the encoding is terminated, the encoding 
5 unit 102 transmits the "pre-encoded" words v' to a shortening unit 20, which 

deletes the components of v' of which the index does not belong to the set s . 
Thus the words v belonging to the shortened code C are obtained. 

Thus, in the numerical example which has just been considered, the 
following is obtained: 

10 v = (y 14 ,y 12 ,iay ,0 .y 9 Ay ,, .o > o). 

It is clear, in view of the above account, that the lower the value of e , 
the lesser will be the calculations implied by the method according to the 
invention. It will now be shown how it is possible to minimize the value of e 
according to a refinement to the invention, for given s . 

15 As was shown above, the value of e is linked to the existence of 

"holes" in a succession of m consecutive elements of s. The refinement in 
question thus consists of searching in s for the sequence of m consecutive 
elements having the least "holes", and of bringing that sequence to the beginning 
of the words of the code C\ taking advantage of the cyclic character of that code. 

20 This strategy will be better understood with the aid of a numerical 

example. Let us resume the example in which the polynomial generator is given by 
equation (2), but this time a set of predetermined positions will be considered 
which is given by: 

s = (1,4,6,7,8,9,1 1,12,13,14) . 

25 Here s m = s 5 = 9 , and thus the strict application of the method 

described above leads to forming the following polynomial from an information 
sequence a of length 5: 

a(x) = oqx 9 + a x x x 1 + a 2 x n + a 3 x 13 + a 4 x X4 . 

Euclidean division makes it possible to construct the word 
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v* = (-r 0 , -n,-r2.-r},-r 4 ,0,O,0,0 l a o ,0 l ai. o 2 ,a 3 ,a 4 ) , 
in which it is proposed, for example, to transform the e = 4 components v* 0 , 
v* 2 , v* 3 , and v* 5 into constants which are all zero in the manner taught by 
the invention. 

5 It will now be shown that there is another manner of performing the 

calculations, which is appreciably less complex since it leads to a lower value of 

e , i.e. e* = 1 . 

Indeed, it is noted that, in s , the sequence of 5 consecutive positions 
8,9,11,12,13 

10 has only a single "hole", i.e. position No. 10. Thus, if on all the words of code C, 
a circular permutation of 8 positions to the left is performed, the initial position of 
that sequence, i.e. position No. 8, is brought to initial position No. 0. Overall, s 
is thus transformed into 

s* = (0,1,3,4,5,6,8,1 1,13,14) , 

15 recalling that the positions of components are defined modulo «' = 15. Thus, in 
£*, the sole "missing" position below s* m =s* 5 =6 is position No. 2, which 
corresponds indeed to e* = 1 . 

In polynomial "language", this circular permutation of 8 positions to 
the left corresponds to a multiplication by x~* modulo (x 15 -1). Now make the 

20 following polynomial correspond to the information sequence a : 

o(x) = a 0 x + a\ x 4 + a 2 x 6 + a 3 x 7 + a 4 x X4 , 
and define 

a * (x) m x -*a(x) = a 4 x 6 + a 0 x % + a,* 1 1 + a 2 x 13 + a 3 x 14 . 
The Euclidean division of a * (x) by g(x) then gives a remainder 

25 r*(x) = r* 4 x 4 +r* 3 x 3 +r* 2 x 2 +r*, x + r* 0 . 

It is thereby deduced that v * (x) = a * (x) - r * (x) , hence 

= (- r * 0 - ~ r * 1 ~ r *2 -r * 3 ~r * 4 . 0, a 4 ,0, a 0 ,0,0, a, ,0, a 2 , a 3 ) . 
In this word v * , solely the component v * 2 = -r * 2 is both non-zero 
(in general) and absent from s*. The technique taught according to the 
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invention is next implemented, now with e* = 1, to deduce from v* a codeword 
v* having v' 2 = 0 . 

It remains nevertheless to apply a circular permutation of 8 positions 
to the components of this word v', this time to the right, to get back to the 
5 positions prescribed by s . In this way a word of C is obtained which is the final 
result of the encoding according to the invention of the information sequence a . 

The words v issuing from the shortening unit 20 are finally 
transmitted by the transmission unit 103 to a predetermined recipient. This 
recipient may for example form part of a complex encoding system (for example 
10 relying on a multiplicity of shortened Reed-Solomon codes). This recipient may, 
according to another example, be a transmission chain comprising a modulator, 
which associates a modulation symbol with each predetermined number of 
binary symbols (bits), followed by a recorder or else (respectively) by a 
transmitter inserting the symbols in a transmission channel, that channel for 
15 example able to be a storage on a suitable carrier (such as a DVD, or a 
magnetic or magnetico-optical disc, or else magnetic tape), or (respectively) 
transmission by wire or wireless transmission (such as a radio link). 

The block diagram of Figure 2 represents, very schematically, a data 
processing apparatus 48 incorporating the encoder 102. 
20 This apparatus 48 comprises a keyboard 911, a screen 909, a 

source of external information 100, a transmitter 103, conjointly connected to 
input/output ports 903 of an encoding device 102 which is implemented here in 
the form of a logic unit. 

The encoding device 102 comprises, connected together by an 
25 address and data bus 902: 

- a central processing unit 900, 

- a random access memory RAM 904, 

- a read only memory 905, and 

- said input/output ports 903. 

30 Each of the elements illustrated in Figure 2 is well known to a person 

skilled in the art of microcomputers and transmission systems and, more 
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generally, of information processing systems. These known elements are 
therefore not described here. It should be noted, however, that: 

- the information source 100 could, for example, be an interface 
peripheral, a sensor, a demodulator, an external memory or other information 

5 processing system (not shown), and could for example supply sequences of 
signals representing speech, service messages or multimedia data in particular 
of IP or ATM type, in the form of sequences of binary data, and 

- the transmitter 103 is adapted to transmit the words belonging to code 
C, for example to a unit belonging to a complex encoding system, or to a device 

10 for sending on a radio channel or for recording on a carrier for mass storage. 

The random access memory 904 stores data, variables and 
intermediate processing results, in memory registers bearing, in the description, 
the same names as the data whose values they store. It should be noted, in 
passing, that the word "register" designates, throughout the present description, 
15 a memory area of low capacity (a few items of binary data) and equally well a 
memory area of high capacity (for storing a complete program) within a random 
access memory or read only memory. 

The random access memory 904 contains in particular the following 

registers: 

20 - a register "information_symbols" in which the information symbols 

belonging to F q are stored, 

- a register "pre-encoded_word$", in which the words v' belonging to 
the non-shortened code are stored, and 

- a register "code_words", in which the words v belonging to the 
25 shortened code are stored, before being submitted to the transmitter 103. 

The read only memory 905 is adapted to store, in registers which, for 
convenience, have the same names as the data which they store: 

- the operating program of the central processing unit 900, in a register 
"program", 

- the length ri of the words belonging to the non-shortened code, in a 
register"//", 
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- the length n of the words belonging to the shortened code, In a 
register "n", 

- the set j of the positions of the components to keep after shortening, 
in a register V\ 

5 - the cardinal of the Galois field F 9 serving as alphabet for the code 

used, in a register "cf\ 

- the number k = n-m of information symbols serving to construct a 
codeword, in a register "/<", and 

- the coefficients of the polynomial generator g(x) of the non-shortened 
1 0 code, in a register "g". 



